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Description 

New secretory leader sequences. 

5 This invention relates to secretory leader sequences which can be employed to direct the secretion of a heterologous 

protein (such as human serum albumin) from fungi (for example the yeast Saccharomyces cerevisiae ). 

Translocation of protein molecules through bi-lipid membranes from one cellular compartment to another generally 
relies upon information held within the primary amino acid sequence of the protein itself. The most prevalent and therefore 
the best characterised sequence information is the amino terminal leader or signal sequence of prokaryotic and eukary- 

10 otic organisms. Genetic studies in which the signal sequence has been totally or extensively deleted indicate that the 
signal sequence is essential for protein translocation (Benson, S.A. et al. 1 985, Ann. Rev. Biochem. 54, 101-1 34). Among 
several hundred known sequences (Watson, M.E.E., 1984, Nuc. Acid. Res. _j_2, 5145-5164) no consensus signal se- 
quence or even an absolute requirement for any amino acid at any given position can be discerned, although a common 
feature of many leader sequences is a core of 7-10 hydrophobic amino acids. Genetic manipulations which result in 

15 alterations to the hydrophobic core, either by deletion or by inserting charged residues, generally result in a block in 
protein translocation (Benson ; S.A., et al. 1985, Ann. Rev. Biochem. 54, 101-134). Moreover, in a series of extensive 
modifications to the chicken lysozyme leader sequence, Yamamoto et al. 1987 (Biochem. and Biophys. Res. Comm. 
149 , 431 -436) have shown that, while some alterations to the hydrophobic core can result in the abolition of secretion ; 
others can potentiate the leader sequence function, resulting in increased levels of protein secretion. 

20 While the leader sequence is usually essential for the translocation of proteins across membranes ; once translocated 

these sequences are usually endoproteolytically cleaved by enzymes contained within the cellular compartments into 
which the proteins have now moved. These enzymes recognise specific amino acid sequences within the primary struc- 
ture of the translocated protein. Moreover, complete processing of certain eukaryotic proteins to their mature form often 
relies upon a series of proteolytic cleavages (Bussey, H., 1988 Yeast 4, 17-26). 

25 With the recent advances in recombinant DN A technology, increasing resources have been brought to bear on the 

commercial exploition of fungi, particularly yeasts, as vehicles for the production of a diverse range of proteins. 

Since many of these proteins are themselves naturally secreted products, it is possible to utilise the information 
contained within the leader sequence to direct the protein through the secretion pathway. However, this information is 
contained within a peptide foreign to yeast. Its recognition and subsequent processing by the yeast secretory pathway 

30 are not necessarily as efficient as those of a homologous yeast leader sequence. As a consequence an alternative 
approach has been to replace the leader sequence with one derived from a naturally secreted yeast protein. 

The most widely used yeast secretory sequence is the 89 amino acid leader sequence of the alpha-factor mating 
pheromone. Processing of this leader has been extensively studied (Kurjan & Herskowitz, Cell 30, 933-943, 1 982; Julius 
et a]. 1983 Cell 32, 839-852; Dmochowska et al. Cell 50, 573-584, 1987; Julius et al. Cell 36: 309-318, 1984; Julius et 

35 al. Cell 37, 1 075-1 085, 1 984) and requires at least four gene products for complete proteolytic cleavage to liberate the 
mature 1 3 amino acid alpha-factor pheromone. 

Complete proteolytic cleavage of the alpha-factor primary translation product requires first the removal of the N-ter- 
minal 1 9 amino acid signal sequence by a signal peptidase within the endoplasmic reticulum. Following this the sequen- 
tial action of three gene products located within the golgi apparatus processes the large precursor molecule, liberating 

40 four copies of the alpha-factor pheromone. These are the KEX2 gene product, an endopeptidase which cleaves after 
the Lys-Arg dibasic amino acid pair, a carboxypeptidase p-like cleavage, recently identified as the product of the KEX1 
gene, and a dipeptidyl amino peptidase, the product of the STE13 gene, which sequentially removes the Glu-Ala or 
Asp-Ala diamino acid pairing preceding the mature alpha-factor pheromone. 

The alpha factor prepro leader sequence has successfully been employed to secrete a range of diverse proteins 

45 and peptides. However, when the alpha-factor signal is used to direct secretion of human serum albumin, we have found 
that a large proportion of the extracellular HSA produced is in the form of a 45KD N-terminal fragment. 

EP-A-252 561 (Sclavo) discloses the use of the 16 amino acid signal peptide (pre-sequence) from the killer toxin 
of Kluyveromyces lactis to aid secretion of heterologous proteins in yeast. 

A further possibility is to use a fusion secretory leader sequence. This may be generated by the fusion of two 

50 independent sequences. A hybrid signal in which the first amino acids of the acid phosphatase signal were fused to the 
proteolytic cleavage site of human alpha interferon resulted in the expression and secretion of interferon (Hinnen et al. 
Foundation for Biochemical and Industrial Fermentation Research, 229 , 1219-1224, 1983); 10% of the interferon pro- 
duced was secreted into the medium. In a similar approach the first 22 amino acids of the alpha-factor leader were fused 
to the last twelve amino acids of the human interferon alpha-2 signal sequence resulting in the secretion of interferon 

55 alpha-2 into the culture supernatant (Piggott et al. Curr. Genet. J_2 561-567, 1987). An identical construct in which the 
interferon alpha-2 gene was replaced by the interferon p gene did not result in any secretion of human interferon (3 into 
the culture supernatant. Finally, in a series of experiments designed to assess the effect of leader sequences on the 
secretion of human lysozyme, Yoshimura et al. (Biochem. & Biophys. Res. Comm. 145 , 712-718, 1987) described a 
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fusion leader comprising the first 9 amino acids of the chicken lysozyme leader and the last 9 amino acids of the As- 
pergillus awamori glycoamylase leader. Although this fusion leader was effective in secreting 60% of the produced 
material into the culture supernatant, it was only 15% as effective as the entire chicken lysozyme leader. Moreover, no 
secreted product could be detected if the human lysozyme sequences were preceded by the entire Aspergillus gly- 
s coamylase leader, or a fusion derived from the first 9 amino acids of the Aspergillus glucoamylase leader and the last 
9 amino acids of the chicken lysozyme leader. 

EP-A-0 220 689 disclosed fusions of the S. cerevisiae barrier protein leader sequence and part of the barrier protein 
with proinsulin. 

We have now devised new and advantageous leader sequences for use in fungi. 
10 One aspect of the invention provides an amino acid sequence as follows: 

( a ) H 2 N-Met-Lys-Trp-Val-Ser-Phe-Ile-Ser-Leu-rieu-Phe-Leu- 
Phe-Ser-Ser-Ala-Tyr-Ser-Azrg-Ser-Leu-Asp-Lys-Axg-COOH 

15 

or 

(b) H 2 N-Met-Asn-Ile-Phe-Tyr-Ile-Phe-Leu-Phe-Leu-LeU"Ser- 

20 Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 

or conservatively modified variations of either sequence, defined as follows. 

Table 1 shows alternative amino acids for each position except the initial methionine. Any of the possible permuta- 
tions are within the scope of the invention. The selection of lysine or arginine for the last two positions is particularly 
25 non-critical, although there should always be Lys or Arg at each of these positions. Positions 20 and 21 of sequence 
(a) are not Gly and Val respectively. Sequences which are up to four amino acids shorter or longer are also included 
provided that the C-terminal Lys-Arg, Arg-Lys : Lys-Lys or Arg-Arg entity is maintained, there is a positively charged 
residue within 5 residues of the N-terminus and there is a generally hydrophobic region at or adjacent the middle of the 
sequence. The pentapeptide adjacent the C-terminal is retained. 
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Table 1 



Leader ( a ) 



Met Lys 


Trp 


Val 


Ser 


Phe 


He 


Arg 


Phe 


Leu 


Thr 


Trp 


Leu 


His 


Tyr 


He 


Gly 


Tyr 


Val 


Gin 




Met 


Ala 




Met 



Asn 



10 

Ser Leu Leu Phe Leu Phe Ser 
Thr He He Trp He Trp Thr 
Gly Val Val Tyr Val Tyr Gly 
Ala Met Met Met Ala 



Ser 


Ala 


Tyr Ser 


Arg 


Ser 


Leu 


Asp 


Lys 


Arg 


Thr 


Thr 


Phe Thr 


Lys 


Thr 


He 


Glu 


Arg 


Lys 


Gly 


Gly 


Trp Gly 


His 


Gly 


Val 


Asn 






Ala 


Ser 


Ala 


Gin 


Ala 


Met 


Gin 







Asn His 



EP0 387 319 B1 



Leader (bp 

5 





Met 


Asn 


He 


Phe 


Tyr 


He 


Phe 


Leu 


Phe 


Leu 


Leu 


Ser 


Phe 


Val 






Asp 


Leu 


Trp 


Phe 


Leu 


Trp 


He 


Trp 


He 


He 


Thr 


Trp 


Leu 


10 




Glu 


Val 


Tyr 


Trp 


Val 


Tyr 


Val 


Tyr 


Val 


Val 


Gly 


Tyr 


He 






Gin 


Met 






Met 




Met 




Met 


Met 


Ala 




Met 


15 


Gin 


His 
Gly 


Ser 


Leu 


Asp 


Lys 


Arg 
















20 


Asp 
Asn 


Ser 
Thr 


Thr 
Gly 


He 
Val 


Asn 
Glu 


Arg 


Lys 
















25 


Glu 


Ala 


Ala 


Met 


Gin 





















His His 

A second aspect provides a fusion compound comprising any of the said amino acid sequences linked, preferably 
directly, at the carboxyl terminal to the N-terminal residue of a polypeptide. The polypeptide may beany desired polypep- 

30 tide, including "pro-polypeptides" (in other words precursors which undergo post -translation a I cleavage or other modi- 
fication, such as glycosylation). The term "polypeptide" encompasses oligopeptides. The polypeptide may befibronectin 
or a portion thereof (for example the collagen or fibrin-binding portions described in EP 207 751), urokinase, pro-urok- 
inase, the 1-368 portion of CD4 (D. Smith et al (1987) Science 328, 1704-1707), platelet derived growth factor (Collins 
et a] (1985) Nature 316, 748-750) ; transforming growth factor p (Deryncket al (1985) Nature 316, 701-705), the 1-272 

35 portion of Von Willebrand's Factor (Bontham et al. Nucl. Acids Res. J_4 7125-7127), the Cathepsin D fragment of fi- 
bronectin (585-1578), a-| -antitrypsin, plasminogen activator inhibitors, factor VIII, a-globin, p-globin, myoglobin or nerve 
growth factor or a conservative variant of any of these. The polypeptide may also be a fusion of HSA or an N-terminal 
portion thereof and any other polypeptide, such as those listed above. Preferably, the polypeptide is a naturally-occurring 
human serum albumin, a modified human serum albumin or a fragment of either, such modified forms and fragments 

40 being termed "variants". These variants include all forms or fragments of HSA which fulfill at least one of the physiological 
functions of HSA and which are sufficiently similar to HSA, in terms of structure (particularly tertiary structure) as to be 
regarded by the skilled man as forms or fragments of HSA. 

In particular variants or fragments of HSA which retain at least 50% of its ligand-binding properties, for example 
with respect to bilirubin or fatty acids, (preferably 80%, or 95%) are encompassed. Such properties are discussed in 

45 Brown, J R. & Shockley, P. (1982) in Lipid-Protein Interactions^, 26-68, Ed. Jost, P.C. & Griffith, O.H. 

The portion of HSA disclosed in EP 322 094 is an example of a useful fragment of HSA which may be secreted by 
use of the leader sequences of the invention. 

A third aspect provides a nucleotide sequence coding for any of the said amino acid sequences or for the said fusion 
compound. The nucleotide sequence (or the portion thereof encoding the leader sequence) may be selected from the 

50 possibilities shown in Tables 2 & 3, for sequences (a) and (b) respectively, where the codons encoding each amino acid 
are listed under the amino acids. The codons of Tables 2 and 3 clearly relate to RNA, but it is to be understood that 
equivalent DNA nucleotide sequences are also within the scope of this aspect of the invention. 
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Table 2 



Met 


Lys 


Trp 


Val 


Ser 


Phe 


lie 


AUG 


AAA 


UGG 


GUU 


UCU 


UUU 


AUU 




AAG 




GUC 


UCC 


UUC 


AUC 








GUA 


UCA 




AUA 








GUG 


UCG 














AGU 














AGC 






Ser 


Ala 


Tyr 


Ser 


Arg 


Ser 


Leu 


UCU 


GCU 


UAU 


UCU 


CGU 


UCU 


UUA 


UCC 


GCC 


UAC 


UCC 


CGC 


UCC 


UUG 


UCA 


GCA 




UCA 


CGA 


UCA 


CUU 


UCG 


GCG 




UCG 


CGG 


UCG 


cue 


AGU 






AGU 


AGA 


AGU 


CUA 


AGC 






AGC 


AGG 


AGC 


CUG 



Ser 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


UCU 


UUA 


UUA 


UUU 


UUA 


UUU 


UCU 


UCC 


UUG 


UUG 


UUC 


UUG 


UUC 


UCC 


UCA 


CUU 


CUU 




CUU 




UCA 




L.UC 


cue 




cue 




UCG 


AGU 


CUA 


CUA 




CUA 




AGU 


AGC 


CUG 


CUG 




CUG 




AGC 


Asp 


Lys 


Arg 










GAU 


AAA 


CGU 










GAC 


AAG 


CGC 











CGA 
CGG 
AGA 
AGG 
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Table 3 



Met 


Asn 


He 


Phe 


Tyr 


lie 


Phe 


Leu 


Phe 


Leu 


Leu 


Ser 


Phe 


Val 




AAU 




T1TTTT 

uuu 


tt a n 
UAU 


AUU 


TTTTTT 
UUU 


TTT T 7\ 

UUA 


UUU 


UUA 


UUA 


UCU 


UUU 


GUU 




AAC 


AUC 


uuc 


UAC 


AUC 


UUC 


UUG 


UUC 


UUG 


UUG 


UCC 


UUC 


GUC 






AUA 






AUA 




CUU 




CUU 


CUU 


UCA 




GUA 
















cue 




cue 


cue 


UCG 




GUG 
















CUA 




CUA 


CUA 


AGU 




















CUG 




CUG 


CUG 


AGC 






Gin 


Gly 


Ser 


Leu 


Asp 


Lys 


Arg 
















CAA 


GGU 


UCU 


UUA 


GAU 


AAA 


CGU 
















CAG 


GGC 


UCC 


UUG 


GAC 


AAG 


CGC 


















GGA 


UCA 


CUU 






CGA 


















GGG 


UCG 


cue 






CGG 

















AGU CUA AGA 

35 AGC CUG AGG 

A fourth aspect provides a DNA construct comprising a suitable control region or regions and a nucleotide sequence 
as defined above, the sequence being under the control of the control region. By "suitable control region" we mean such 
DNA regions as are necessary to enable the said nucleotide sequence to be expressed in the host for which the construct 
is intended. The control region will usually include transcriptional start and stop sequences, 3'-polyadenylation sequenc- 
40 es, a promoter and, often, an upstream activation site for the promoter. The man skilled in the art will readily be able to 
select and assemble suitable regions from those available in this art. However, specific examples of suitable expression 
vectors and their construction include those disclosed in EP 198 745, GB 2 171 703 (for B.subtilis ), EP 207 165, EP 
116 201, EP 123 244, EP 123 544, EP 147 198, EP 201 239, EP 248 637, EP 251 744, EP 258 067, EP 286 424 and 
EP 322 094. 

45 A fifth aspect provides a host transformed with the said DNA construct. The host may be any host in which the 

construct is found to work adequately, including bacteria, yeasts, filamentous fungi, insect cells, plant cells and animal 
cells. Preferably, however, the host is Saccharomyces cerevisiae or Schizosaccharomyces pombe , most preferably 
the former. As many native secretion signals are effective in heterologous hosts (for example the natural HSA leader 
sequence in yeast) it is entirely reasonable to suppose that the leader sequences of the invention will function in hosts 

50 other than yeasts. 

A sixth aspect provides a process for preparing a polypeptide, comprising cultivating the said host and obtaining 
therefrom the polypeptide expressed by the said nucleotide sequence, or a modified version thereof. 

By "modified version thereof", we mean that the actual polypeptide which is separated may have been post-trans- 
lationally modified, in particular by cleavage of the leader sequence. 
55 So that the invention may be more readily understood, preferred aspects will now be illustrated by way of example 

and with reference to the accompanying drawings in which: 

Figure 1 is a restriction map of plasmid pEK113; 
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Figure 


2 is a restriction map of 


plasmid 


pEK25; 




Figure 


Q jc a rpQtrirtion man of 


nlp^mid 

yj icioi 1 1 \\j 


DAYE230 


5 


Figure 


4 is a restriction map of 


plasmid 


pAYE238; 




Figure 
and 


5 is a restriction map of 


plasmid 


pAYE304; 


10 


Figure 


6 is a restriction map of 


plasmid 


pAYE305. 



Example of a prior art type of leader sequence 

The DNA coding sequence for mature HSA protein has been placed immediately downstream of a DNA sequence 
15 encoding the KEX2 cleavage site of the alpha factor pre pro leader sequence (85 amino acids). When this protein 
sequence is placed under the control of a promoter on a yeast autonomously replicating plasmid and transformed into 
a haploid strain of the yeast Saccharomyces cerevisiae , mature HSA can be detected in the culture supernatant. N-ter- 
minal amino acid sequence information indicates that the secreted protein has the same N-terminal amino acid compo- 
sition as natural HSA, namely Asp-Ala-His. This also indicates that the first two amino acids of the secreted HSA are 
20 not susceptible to the dipeptidyl endopeptidase, the product of the STE1 3 gene, as this enzyme is responsible for the 
removal of such sequences from between successive repeats of the alpha-factor pheromone. Although mature HSA is 
the major product observed in the culture supernatant, a N-terminal fragment of HSA (45 kilodaltons) was also detected, 
representing approximately 15% of the total HSA synthesised. This fragment component represents not only a waste 
of secretion capacity but also certain downstream purification problems in that, as a fragment of HSA, it shares some 
25 biochemical and biophysical properties with intact HSA. 

EXAMPLE 1 

We have constructed a fusion leader which may be regarded as the natural HSA leader sequence from which the 
30 last five amino acids have been removed, to be replaced by the five amino acids preceding the KEX2 cleavage site of 
the alpha-factor pre pro leader sequence, i.e. amino acids 81 to 85, Ser-Leu-Asp-Lys-Arg (Table 2). 

When transformed with suitable plasmid vectors incorporating the fusion leader, yeast secrete mature HSA into the 
culture supernatant at levels comparable to that observed with the alpha-factor leader sequence. N-terminal sequence 
analysis indicates that the mature HSA possesses the correct N-terminal amino acid composition. 
35 Moreover, substitution of the alpha-factor leader by the fusion leader sequence has been found to result in a 6 fold 

reduction in the levels of the 45 kd fragment observed in the culture supernatant. This therefore represents a significant 
improvement in the reduction of the contaminating polypeptides, thus aiding the purification of mature HSA from yeast 
culture supernatants. 

40 Details 

Unless otherwise stated all procedures were carried out as described by Maniatis et al (1982). Plasmid pEK113 
(Figure 1) (EP-A-248 637) was digested to completion with the restriction endonucleases Mst ll and Hind lll. DNA was 
recovered by phenol/chloroform extraction and ethanol precipitation. The linearised plasmid DNA was then treated with 
45 the Klenow fragment of E.coli DNA polymerase I to generate a linearised DNA molecule with blunt ends. 

The following oligonucleotide duplex (I) was constructed on an automated Applied Biosystems Inc 380B DNA syn- 
thesiser (according to manufacturer's instructions). 

Oligonucleotide I 

50 

5' 3' 
GGC TTA TAA GGA TCC TTA TAA GCC 

55 

CCG AAT ATT CCT AGG AAT ATT CGG 

The oligonucleotide duplex was ligated with equimolar quantities of linearised, blunt ended pEK113. E.coli strain 
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MC1061 was transformed with the ligation mixture and cells receiving DNA were selected on an ampicillin-containing 
medium (50ug/ml ampicillin). Recombinant plasmids containing the oligonucleotide duplex were screened by digesting 
DNA prepared from individual colonies with the restriction endonucleases Mstll and Eco RI. Plasmid pEK25 was thus 
formed (Figure 2). 

5 Plasmid pEK25 was digested to completion with the restriction endonucleases Xba l and BamHI, DNA fragments 

were separated by electrophoresis through a 1% (w/v) agarose gel and a 688 base pair Xba l - Bam HI DNA fragment 
recovered from the gel by electroelution. 

The plasmid mp19.7 (EP-A-248 637) was digested to completion with the restriction endonuclease Xho l. The line- 
arised DNA was phenol/chloroform extracted and ethanol precipitated. The recovered DNA was then treated with the 

10 Klenow fragment of E. coli DNA polymerase I as previously described, following which the DNA was phenol/chloroform 
extracted and ethanol precipitated. The recovered DNA was then digested to completion with Xba l and the digestion 
products separated by agarose gel electrophoresis. A 1067 base pair fragment was recovered from the gel by electro- 
elution. The following oligonucleotide duplex (II) was prepared as described previously. 

15 Oligonucleotide II 



GATCC ATG AAG TGG GTA AGC TTT ATT TCC CTT CTT TTT CTC 
TAC TTC ACC CAT TCG AAA TAA AGG GAA GAA AAA GAG 

25 

3' 

3o TTT AGC TCG GCT TAT TCC AGG AGC TTG GAT AAA AGA 

AAA TCG AGC CGA ATA AGG TCC TCG AAC CTA TTT TCT 

The plasmid pUC19 (Yanisch-Perron et a]. 1985) was digested to completion with the restriction endonuclease 
35 Bam HI. Linearised DNA was recovered by phenol/chloroform extraction and ethanol precipitation. 

Equimolar quantities of the Bam HI digested pUC19, the oligonucleotide duplex II, the 1067 b.p. DNA fragment 
derived from mp1 9.7 and the 688 b.p. DNA fragment derived from pEK25 were ligated together. E.coli DH5 was trans- 
formed with the ligated DNA and transformants selected on 50ug/ml ampicillin L-broth agar. Recombinant colonies 
containing the desired plasmid, designated pAYE 230 (Figure 3) were selected by digested DNA obtained from individual 
40 colonies with the restriction endonuclease Bam HI. 

Plasmid pAYE 230 was digested to completion with Bam HI and the products separated by electrophoresis through 
a 1% agarose gel. The 1832 base pair fragment containing the HSA coding sequence was recovered by electroelution. 

Plasmid pMA91 (Melloretaj. 1 983) was digested to completion with Bgl ll under standard conditions. The linearised 
plasmid was phenol/chloroform extracted and ethanol precipitated. 
45 Equivalent quantities of the linearised pMA91 and the DNA fragment prepared from pAYE 230 were ligated under 

standard conditions. E. coli DH5 was transformed with the ligation mixture and cells receiving the DNA selected on 
L-broth agar containing 50u.g/ml ampicillin. Colonies containing the desired plasmid, designating pAYE 238 (Figure 4) 
were selected by digesting the DNA from such colonies with Pvu ll. 

Plasmid pAYE 238 was transformed into the yeast Saccharomyces cerevisiae strain S150-2B as described by 
50 Hinnen et aj. (1 978). Cells receiving plasmid pAYE 238 were selected on minimal medium, supplemented with 2% (w/v) 
glucose, 20mg/l histidine, 20mg/l tryptophan and 20mg/l uracil. 

Transformed S150-2B cells were transferred to 10ml YEPD media containing 2% (w/v) glucose and incubated at 
30°C, 200rpm for 72 hours. Cell free culture supernatants were analysed by discontinuous native 8-25% gradient poly- 
acrylamide gel electrophoresis on a Pharmacia Phast System, as described in the manufacturer's instructions. Cells 
55 were stained and destained and the relative quantities of native HSA and HSA fragment estimated by gel scan at 595nm. 
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EXAMPLE 2 

We have also constructed a second fusion leader which consists of the 16 amino acid pre region of the 97,000 
dalton Kluvveromvces lactis killer (ORF 2) toxin (Stark and Boyd, 1986, Tokumaga et a| 1987) fused to the five amino 
5 acids preceding the KEX2 cleavage site of the alpha-factor prepro leader sequence, i.e. amino acids 81 to 85 ; 
Ser-Leu-Asp-Lys-Arg (Table 3). 

When transformed with plasmid vectors incorporating the fusion leader described in Table 3, yeast secreted mature 
HSA into the culture supernatants at levels higherthan when eitherthe natural K. lactis prepro killer toxin leadersequence 
or the alpha-factor prepro leader sequence was used. N-terminal sequence analysis indicates that the mature HSA 
10 possesses the correct N-terminal amino acid composition. 

Substitution of the alpha-factor leader by the K.lactis killer/alpha factor fusion leader sequence resulted in a six fold 
reduction in the levels of the 45kd fragment observed in the culture supernatant. This therefore represents a significant 
improvement in the reduction of the contaminating polypeptides, thus aiding the purification of mature HSA from yeast 
culture supernatants. 

15 

Details 

The experimental procedures employed to generate a yeast HSA secretion vector utilising the K.lactis killer/alpha 
factor fusion leader were identical to those described in Example 1 , except that oligonucleotide duplex (II) was replaced 
20 by oligonucleotide duplex (III) synthesised on an automated Applied Biosystems Inc. 380B DNA synthesiser (according 
to manufacturer's instructions). 

Oligonucleotide duplex III 

25 

GATCC ATG AAT ATA TTT TAC ATA TTT TTG TTT TTG CTG TCA TTC 
TAC TTA TAT AAA ATG TAT AAA AAC AAA AAC GAC AGT AAG 

30 

GTT CAA GGA AGC TTG GAT AAA AGA 

3s CAA GTT CCT TCG AAC CTA TTT TCT 

Equimolar quantities of the BamHI digested pUC1 9, the oligonucleotide duplex III, the 1 067bp DNAfragment derived 
from mp19.7 and the 688b. p. DNA fragment derived from pEK25 were ligated together. E.coli DH5 was transformed 
with ligated DNA and transformants selected on 50jug/ml ampicillin L-broth agar. Recombinant colonies containing the 
desired plasmid : designated pAYE304 (Figure 5), were selected by digested DNA obtained from individual colonies with 
40 the restriction endonuclease Bam HI. 

Plasmid pAYE304 was digested to completion with BamHI and the products separated by electrophoresis through 
a 1% agarose gel. The 1823 base pair fragment containing the HSA coding sequence was recovered by electroelution. 

Plasmid pMA91 (MelloretaJ, 1 983) was digested to completion with Bgl ll under standard conditions. The linearised 
plasmid was phenol/chloroform extracted and ethanol precipitated. 
45 Equivalent quantities of the linearised pMA91 and the DNA fragment prepared from pAYE304 were ligated under 

standard conditions. E.coli DH5 was transformed with the ligation mixture and cells receiving DNA selected on L-broth 
agar containing 50|Lig/iml ampicillin. Colonies containing the desired plasmid, designating pAYE305 (Figure 6), were 
selected by digesting the DNA from such colonies with Pvu ll. 

Plasmid pAYE305 was transformed into the yeast Saccharomyces cerevisiae strain S150-2B as described by 
50 Hinnen et al, (1978). Cells receiving plasmid pAYE305 were selected on minimal medium, supplemented with 2% (w/v) 
glucose, 20mg/l histidine, 20mg/l tryptophan and 20mg/l uracil. 

Transformed S150-2B cells were transferred to 10ml YEPD medium containing 2% (w/v) glucose and incubated at 
30°C, 200rpm for 72 hours. Cell free culture supernatants were analysed by discontinuous native 8-25% gradient poly- 
acrylamide gel electrophoresis on a Pharmacia Phast System, as described in the manufacturer's instructions. 
55 Cells were stained and destained and the relative quantities of native HSA and HSA fragment estimated by gel scan 

at 595nm. 



10 



EP0 387 319 B1 



EXAMPLE 3 

Using a vector based on the disintegration vectors of EP286424 (Delta Biotechnology), a suitable promoter and the 
fusion leader of Example 1 above, Schizosaccharomyces pombe (strain J_eul. 32 h) was transformed and fermented at 
5 30°C in 10ml of EMM (Edinburgh minimal medium, Ogden, J.E. & Fantes ; P.A. (1986) Curr. Genetics J_0 509-514) ; 
buffered to pH 5.6 with 0.1M citric acid/sodium phosphate, to give 10-15 mg/l of HSA in the culture supernatant after 3 
days. 
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Claims 

35 1. A polypeptide having an amino acid sequence as follows: 
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(a) 



1 10 

Met Lys Trp Val Ser Phe lie Ser Leu Leu Phe Leu Phe Ser 
Arg Phe Leu Thr Trp Leu Thr lie lie Trp lie Trp Thr 
His Tyr lie Gly Tyr Val Gly Val Val Tyr Val Tyr Gly 
Gin Met Ala Met Ala Met Met Met Ala 

Asn 



20 

Ser Ala Tyr Ser Arg Ser Leu Asp Lys Arg 
Thr Thr Phe Thr Lys Thr lie Glu Arg Lys 
Gly Gly Trp Gly His Gly Val Asn 
Ala Ser Ala Gin Ala Met Gin 

Asn His 

or 



Met Asn lie Phe Tyr lie Phe 
Asp Leu Trp Phe Leu Trp 
Glu Val Tyr Trp Val Tyr 
Gin Met Met 
His 



Leu Phe Leu Leu Ser Phe Val 
lie Trp lie lie Thr Trp Leu 
Val Tyr Val Val Gly Tyr lie 
Met Met Met Ala Met 



Gin Gly Ser Leu Asp Lys Arg 
Asp Ser Thr lie Asn Arg Lys 
Asn Thr Gly Val Glu 
Glu Ala Ala Met Gin 
His His 

wherein vertically aligned groups of amino acids represent alternative amino acids at the position shown, and 
positions 20 and 21 of sequence (a) are not Gly and Val respectively, or a polypeptide which is up to four amino 
acids shorter or longer 

provided (i) that the C-terminal Lys-Arg, Arg-Lys, Lys-Lys or Arg-Arg entity is maintained, (ii) that there is a 
positively charged residue within 5 residues of the N-terminus, (iii) that there is a generally hydrophobic region 
at or adjacent the middle of the sequence, 

and (iv) that the three amino acids N-terminal to the C-terminal Lys/Arg, Lys/Arg entity are maintained. 



A polypeptide according to claim 1 having an amino acid sequence as follows: 
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5 1 10 

Met Lys Trp Val Ser Phe lie Ser Leu Leu Phe Leu Phe Ser 

Arg Phe Leu Thr Trp Leu Thr lie lie Trp lie Trp Thr 
10 His Tyr lie Gly Tyr Val Gly Val Val Tyr Val Tyr Gly 

Gin Met Ala Met Ala Met Met Met Ala 

Asn 



15 



20 



25 or 



20 

Ser Ala Tyr Ser Arg Ser Leu Asp Lys Arg 

Thr Thr Phe Thr Lys Thr lie Glu Arg Lys 

Gly Gly Trp Gly His Gly Val Asn 

Ala Ser Ala Gin Ala Met Gin 

Asn His 



30 


Met Asn 


lie 


Phe 


Tyr 


lie 


Phe 


Leu 


Phe 


Leu 


Leu 


Ser 


Phe 


Val 




Asp 


Leu 


Trp 


Phe 


Leu 


Trp 


lie 


Trp 


lie 


lie 


Thr 


Trp 


Leu 




Glu 


Val 


Tyr 


Trp 


Val 


Tyr 


Val 


Tyr 


Val 


Val 


Gly 


Tyr 


lie 


35 


Gin 
His 


Met 






Met 




Met 




Met 


Met 


Ala 




Met 



Gin 


Gly 


Ser 


Leu 


Asp 


Lys 


Arg 


Asp 


Ser 


Thr 


lie 


Asn 


Arg 


Lys 


Asn 


Thr 


Gly 


Val 


Glu 






Glu 


Ala 


Ala 


Met 


Gin 







45 His His 

wherein vertically aligned groups of amino acids represent alternative amino acids at the position shown, or a 
polypeptide which is up to four amino acids longer 

50 

provided that positions 20 and 21 of sequence (a) are not Gly and Val respectively. 
3. A polypeptide according to Claim I or 2 and having an amino acid sequence: 

(a) H 2 N-Met-Lys-Trp-Val-Ser-Phe-Ile-Ser-Leu-Leu-Phe-Leu- 
Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys-Arg-COOH 
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or 

(b) H 2 N-Met-Asn-Ile-Phe-Tyr-Ile-Phe-Leu-Phe-Leu-LGU-Ser- 
Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 

4. A fusion compound comprising a polypeptide according to Claim 1 , 2 or 3 linked at the carboxyl terminal to the 
N-terminal residue of a second polypeptide. 

10 5. A fusion compound according to Claim 4 wherein the polypeptide of Claim 1 or 2 is linked directly to said second 
polypeptide. 

6. A fusion compound according to Claim 5 wherein the second polypeptide is a naturally-occurring human serum 
albumin (HSA), a modified human serum albumin having a tertiary structure similar to that of HSA and having at 

15 least one physiological function in common with HSA, or a fragment of either. 

7. A polynucleotide sequence encoding a polypeptide according to Claim 1 , 2 or 3 or a fusion compound according 
to Claim 4. 

20 8. A polynucleotide according to Claim 7 selected from the sequences shown in Tables 2 and 3. 

9. A DNA construct comprising a suitable control region or regions and a polynucleotide according to Claim 7 or 8 : 
said polynucleotide being under the control of the control region. 

25 10. A host transformed with a DNA construct according to Claim 9. 

1 1 . Saccharomvces cerevisiae or Schizosaccharomyces pombe according to Claim 1 0. 

12. A process for preparing a polypeptide, comprising cultivating a host according to Claim 10 or 11 and obtaining 
30 therefrom the polypeptide expressed by the said polynucleotide or a modified version of the polypeptide. 



Patentanspruche 

35 1. Polypeptid mit einer Aminosauresequenz wie folgt: 
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10 



Met 


Lys 


Trp 


Val 


Ser 


Phe 


He 


Ser 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 




Arg 


Phe 


Leu 


Thr 


Trp 


Leu 


Thr 


He 


He 


Trp 


He 


Trp 


Thr 




His 


Tyr 


lie 


Gly 


Tyr 


Val 


Gly 


Val 


Val 


Tyr 


Val 


Tyr 


Gly 




Gin 




Met 


Ala 




Met 


Ala 


Met 


Met 




Met 




Ala 




Asn 




































20 


















Ser 


Ala 


Tyr 


Ser 


Arg 


Ser 


Leu 


Asp 


Lys 


Arg 










Thr 


Thr 


Phe 


Thr 


Lys 


Thr 


He 


Glu 


Arg 


Lys 










Gly 


Gly 


Trp 


Gly 


His 


Gly 


Val 


Asn 














Ala 


Ser 




Ala 


Gin 


Ala 


Met 


Gin 






















Asn 






His 














oder 






























(b) 
























Met 


Asn 


lie 


Phe 


Tyr 


He 


Phe 


Leu 


Phe 


Leu 


Leu 


Ser 


Phe 


Val 




Asp 


Leu 


Trp 


Phe 


Leu 


Trp 


He 


Trp 


He 


He 


Thr 


Trp 


Leu 




Glu 


Val 


Tyr 


Trp 


Val 


Tyr 


Val 


Tyr 


Val 


Val 


Gly 


Tyr 


He 




Gin 


Met 






Met 




Met 




Met 


Met 


Ala 




Met 



His 



Gin Gly Ser Leu Asp 

Asp Ser Thr He Asn 

Asn Thr Gly Val Glu 

Glu Ala Ala Met Gin 

His His 

wobei vertikal ausgerichtete Gruppen von Aminosauren fur alternative Aminosauren an der angegebenen Stelle 
stehen und die Stellen 20 bzw. 21 der Sequenz (a) nicht von Gly bzw. Val eingenommen werden, oder ein Polypeptid ; 
welches bis zu vier Aminosauren kurzer oder langer ist, wobei gilt, da(3 

(i) die C-terminale Lys-Arg-, Arg-Lys-, Lys-Lys- oder Arg-Arg-Einheit erhalten ist, 

(ii) es einen positiv geladenen Rest innerhalb von funf Resten des N-Terminus gibt, 

(iii) ein im allgemeinen hydrophober Bereich in oder nachst der Mitte der Sequenz vorhanden ist und 

(iv) die drei Aminosauren N-terminal zu der C-terminalen Lys/Arg, Lys/Arg-Einheit erhalten sind. 



Lys Arg 
Arg Lys 



Polypeptid nach Anspruch 1 mit einer Aminosauresequenz wie folgt: 
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JaJ_ 

1 10 

Met Lys Trp Val Ser Phe lie Ser Leu Leu Phe Leu Phe Ser 
Arg Phe Leu Thr Trp Leu Thr lie lie Trp lie Trp Thr 
His Tyr lie Gly Tyr Val Gly Val Val Tyr Val Tyr Gly 
Gin Met Ala Met Ala Met Met Met Ala 



Oder 



Asn 



Ser Ala Tyr Ser Arg Ser Leu Asp Lys Arg 

Thr Thr Phe Thr Lys Thr He Glu Arg Lys 

Gly Gly Trp Gly His Gly Val Asn 

Ala Ser Ala Gin Ala Met Gin 

Asn His 



_QlL 

Met Asn He Phe Tyr lie Phe Leu Phe Leu Leu Ser Phe Val 

Asp Leu Trp Phe Leu Trp He Trp lie lie Thr Trp Leu 

Glu Val Tyr Trp Val Tyr Val Tyr Val Val Gly Tyr lie 

Gin Met Met Met Met Met Ala Met 

His. 



Gin Gly Ser Leu Asp Lys Arg 
Asp Ser Thr He Asn Arg Lys 
Asn Thr Gly Val Glu 
Glu Ala Ala Met Gin 
His His 

wobei vertikal ausgerichtete Gruppen von Aminosauren fur alternative Aminosauren an der angegebenen Stelle 
stehen, oder ein Polypeptid, welches bis zu vier Aminosauren langer ist, wobei gilt, daB die Stellungen 20 bzw. 21 
der. Sequenz (a) nicht von Gly bzw. Val eingenommen sind. 

Polypeptid nach Anspruch 1 oder 2 mit einer Aminosauresequenz: 

(a) H 2 N-Met-Lys-Trp-Val-Ser-Phe-Ile-Ser-Leu-Leu-Phe-Leu- 
Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys-Arg-COOH 

oder 

(b) H 2 N-Met-Asn-Ile-Phe-Tyr-Ile-Phe-Leu-Phe-Leu-Leu-Ser- 
Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 
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4. Fusionsverbindung, umfassend ein Polypeptid nach Anspruch 1 , 2 oder 3, welches am Carboxylende an den N-ter- 
minalen Rest eines zweiten Polypeptids gebunden ist. 

5. Fusionsverbindung nach Anspruch 4, wobei das Polypeptid nach Anspruch 1 oder 2 direkt an das zweite Polypeptid 
5 gebunden ist. 

6. Fusionsverbindung nach Anspruch 5, wobei das zweite Polypeptid aus naturlich vorkommendem Humanserumal- 
bumin (HSA) ; einem modifizierten Humanserumalbumin mit tertiarer Struktur ahnlich derjenigen von HSA und min- 
destens einer physiologischen Funktion, die mit HSAgemeinsam ist, oder einem Fragment eines derselben besteht. 

10 

7. Polynucleotidsequenz mit Codierung fur ein Polypeptid nach Anspruch 1 , 2 oder 3 oder eine Fusionsverbindung 
nach Anspruch 4. 

8. Polynucleotid nach Anspruch 7, ausgewahlt aus den in Tabellen 2 und 3 dargestellten Sequenzen. 

15 

9. DNA-Konstrukt, umfassend (einen) geeignete(n) Steuerbereich oder -bereiche und ein Polynucleotid nach 
Anspruch 7 oder 8, wobei das Polynucleotid unter der Kontrolle des Steuerbereichs steht. 

10. Mit einem DNA-Konstrukt nach Anspruch 9 transformierter Wirt. 

20 

11. Saccharomyces cerevisiae oder Schizosaccharomyces pombe nach Anspruch 10. 



25 



30 



12. Verfahren zur Herstellung eines Polypeptids durch Zuchten eines Wirts nach Anspruch 10 oder 11 und Gewinnen 
des durch das Polynucleotid exprimierten Polypeptids oder einer modifizierten Version des Polypeptids aus diesem. 

Revendications 

1. Polypeptide ayant une sequence d'acides amines comme suit : 



1 10 

35 





Met Lys 


Trp 


Val 


Ser 


Phe 


He 


Ser 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 




Arg 


Phe 


Leu 


Thr 


Trp 


Leu 


Thr 


He 


He 


Trp 


He 


Trp 


Thr 




His 


Tyr 


He 


Gly 


Tyr 


Val 


Gly 


Val 


Val 


Tyr 


Val 


Tyr 


Gly 


40 


Gin 




Met 


Ala 




Met 


Ala 


Met 


Met 




Met 




Ala 



Asn 





Ser 


Ala 


Tyr 


Ser 




Thr 


Thr 


Phe 


Thr 


50 


Gly 


Gly 


Trp 


Gly 




Ala 


Ser 




Ala 



ou 

55 



Arg 


Ser 


Leu 


Asp 


Lys 


Arg 


Lys 


Thr 


He 


Glu 


Arg 


Lys 


His 


Gly 


Val 


Asn 






Gin 


Ala 


Met 


Gin 






Asn 






His 
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Met Asn 


He 


Phe 


Tyr 


He 


Phe 


Leu 


Phe 


Leu 


Leu 


Ser 


Phe 


Val 


Asp 


Leu 


Trp 


Phe 


Leu 


Trp 


He 


Trp 


He 


He 


Thr 


Trp 


Leu 


Glu 


Val 


Tyr 


Trp 


Val 


Tyr 


Val 


Tyr 


Val 


Val 


Gly 


Tyr 


He 


Gin 


Met 






Met 




Met 




Met 


Met 


Ala 




Met 



His 



15 


Gin 


Gly 


Ser 


Leu 


Asp 


Lys 


Arg 




Asp 


Ser 


Thr 


He 


Asn 


Arg 


Lys 




Asn 


Thr 


Gly 


Val 


Glu 






20 


Glu 


Ala 


Ala 


Met 


Gin 






His 








His 







dans laquelle les groupes d'acides amines alignes a la verticale represented des acides amines alternatifs a la 
position indiquee et les positions 20 et 21 de la sequence (a) ne sont ni Gly ni Val, respectivement, ou un polypeptide 
25 qui compte jusqu'a quatre acides amines de moins ou de plus, a condition (i) que I'entite C-terminale Lys-Arg ; 

Arg-Lys, Lys-Lys ou Arg-Arg soit maintenue, (ii) qu'il y ait un residu charge positivement parmi les 5 residus de 
I'extremite N-terminale, (iii) qu'il y ait une region generalement hydrophobe au milieu de la sequence ou dans une 
position adjacente, et (iv) que les trois acides amines N-terminaux jusqu'a I'entite C-terminale Lys/Arg, Lys/Arg soit 
maintenus. 

30 

2. Polypeptide selon la revendication 1 ayant une sequence d'acides amines comme suit : 

35 

1 10 



Met Lys 


Trp 


Val 


Ser 


Phe 


He 


Ser 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


Arg 


Phe 


Leu 


Thr 


Trp 


Leu 


Thr 


He 


He 


Trp 


He 


Trp 


Thr 


His 


Tyr 


He 


Gly 


Tyr 


Val 


Gly 


Val 


Val 


Tyr 


Val 


Tyr 


Gly 


Gin 




Met 


Ala 




Met 


Ala 


Met 


Met 




Met 




Ala 



Asn 

45 

20 





Ser 


Ala 


Tyr 


Ser 


Arg 


Ser 


Leu 


Asp 


Lys 


Arg 


50 


Thr 


Thr 


Phe 


Thr 


Lys 


Thr 


He 


Glu 


Arg 


Lys 




Gly 


Gly 


Trp 


Gly 


His 


Gly 


Val 


Asn 








Ala 


Ser 




Ala 


Gin 


Ala 


Met 


Gin 







Asn His 

ou 
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an 





J\ on 


Tin 
lie 


±r lie 


iyr 


Tin 

i. ±e 


rne 


Leu 


T5 V» 

irne 


Leu 


Leu 


Ser 


Fne 


Val 




Asp 


Leu 


Trp 


T-V V> ^-v 

Pxie 


Leu 


Trp 


He 


Trp 


I le 


He 


Thr 


Trp 


Leu 




Glu 


Val 


Tyr 


Trp 


Val 


Tyr 


Val 


Tyr 


Val 


Val 


Gly 


Tyr 


He 




Gin 


Met 






Met 




Met 




Met 


Met 


Ala 




Met 




His 


























Gin 


Gly 


Ser 


Leu 


Asp 


Lys 


Arg 
















Asp 


Ser 


Thr 


He 


Asn 


Arg 


Lys 
















Asn 


Thr 


Gly 


Val 


Glu 




















Glu 


Ala 


Ala 


Met 


Gin 





















His His 

dans laquelle les groupes d'acides amines alignes a la verticale represented des acides amines alternatifs a la 
position indiquee, ou un polypeptide qui compte jusqu'a quatre acides amines de plus, a condition que les positions 
20 et 21 de la sequence (a) ne soient ni Gly ni Val, respectivement. 

Polypeptide selon la revendication 1 ou 2 et ayant une sequence d'acides amines : 

(a) H2N-Met-Lys-Trp-Val-Ser-Phe-He-Ser-Leu-Leu-Phe-Leu- 

Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys-Arg-COOH 

ou 

( b ) H^N— Met— Asn — I le— Phe— Tyr— I le— Phe— Leu— Phe— Leu— Leu— 

Ser-Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 

Compose de fusion comprenant un polypeptide selon la revendication 1 , 2 ou 3 lie au niveau de I'extremite carboxyle 
au residu N-terminal d'un deuxieme polypeptide. 

Compose de fusion selon la revendication 4, dans lequel le polypeptide selon la revendication 1 ou 2 est directement 
lie audit deuxieme polypeptide. 

Compose de fusion selon la revendication 5, dans lequel le deuxieme polypeptide est une serum-albumine humaine 
naturelle (SAH), une serum-albumine humaine modifiee ayant une structure tertiaire similaire a celle de la SAH et 
ayant au moins une fonction physiologique en commun avec la SAH ; ou un fragment de Tune ou I'autre. 

Sequence polynucleotidique codant un polypeptide selon la revendication 1 , 2 ou 3ou compose de fusion selon la 
revendication 4. 

Polynucleotide selon la revendication 7 choisi parmi les sequences indiquees dans les Tableaux 2 et 3. 

Fragment d'ADN comprenant une ou des regions de commande appropriees et un polynucleotide selon la reven- 
dication 7 ou 8, ledit polynucleotide etant sous la commande de la region de commande. 
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Hote transforme avec un fragment d'ADN selon la revendication 9. 

Saccharomyces cerevisiae ou Schizosaccharomyces pombe selon la revendication 10. 

Procede de preparation d'un polypeptide, consistant a cultiver un hote selon la revendication 10 ou 11 et a obtenir 
de celui-ci le polypeptide exprime par ledit polynucleotide ou une version modifiee du polypeptide. 
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p BR 322 DNA 
MET-HSA DNA 
Restriction endonuc lease sites 



Fig. 1 
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Plasmid pEK 25 



Eco R1 




Fig. 2 
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Plasmid pAYE 230 




Bam HI 
Xbal 



Fig. 3 
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Plasmid pAYE 238 



Eco Rl 




Fig. 4 
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Plasmid pAYE 304 




Bam HI 
Xbal 



Fig. 5 
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Plasmid pAYE 305 



Eco RI 



EcoRI 




Fig. 6 
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